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Abstract — Delay tolerant Ad-hoc Networks leverage the mobil- 
ity of relay nodes to compensate for lack of permanent connectiv- 
ity and thus enable communication between nodes that are out of 
range of each other. To decrease delivery delay, the information 
to be delivered is replicated in the network. Our objective in this 
paper is to study a class of replication mechanisms that include 
coding in order to improve the probability of successful delivery 
within a given time limit. We propose an analytical approach that 
allows to quantify tradeoffs between resources and performance 
measures (energy and delay). We study the effect of coding on 
the performance of the network while optimizing parameters that 
govern routing. Our results, based on fluid approximations, are 
compared to simulations which validate the modelQ 

Index Terms — Forward correction, fountain codes, delay toler- 
ant networks 



I. Introduction 

Delay tolerant Ad-hoc Networks make use of nodes' mo- 
bility to compensate for lack of instantaneous connectivity. 
Information sent by a source to a disconnected destination 
can be forwarded and relayed by other mobile nodes. There 
has been a growing interest in such networks as they have 
the potential of providing many popular distributed services 
[1H3]. 

A naive approach to forward a file to the destination is by 
epidemic routing in which any mobile that has the message 
keeps on relaying it to any other mobile that falls within its 
radio range. This would minimize the delivery delay at the 
cost of inefficient use of network resources (e.g. in terms 
of the energy used for flooding the network). The need for 
more efficient use of network resources motivated the use of 
less costly forwarding schemes such as the two-hops routing 
protocols. In two-hops routing the source transmits copies of 
its message to all mobiles it encounters; relays transmit the 
message only if they come in contact with the destination. 
The two-hops protocol was originally introduced in [4]. 

In this paper we consider another aspect, i.e, the tradeoff 
between network resources and delay. We assume that the file 
to be transferred needs to be split into K smaller units: this 
happens due to the finite duration of contacts between mobile 
nodes or when the file is large with respect to the buffering 
capabilities of nodes. Such K smaller units (which we call 
chunks or frames) need to be forwarded independently of the 
others. The message is considered to be well received only if 
all K frames are received at the destination. 

'A version of this paper appears in the proc. of IEEE INFOCOM 2009. 



After fragmenting the message into smaller frames, it is 
convenient to better organize the way information is stored in 
the relay nodes. We aim at improving the efficiency of the 
DTN's operation by letting the source distribute not only the 
original frames but also additional redundant packets. This 
results in a spatial coding of the distributed storage of the 
frames. We consider coding based on either forward error 
correction techniques or on network coding approaches. Our 
main contribution is to provide a close form expression for the 
performance of DTNs (in terms of transfer delay and energy 
consumption) as a function of the coding that is used. Also, 
we derive scaling laws for the success probability of message 
delivery. 

The paper is organized as follows. In Sec. |T] we revise the 
state of art and outline the major contributions of the paper. 
In Sec. HI] we describe the model of the system and in Sec. [Til] 
we derive the main results for the case of erasure codes. 
Sec.|lV]is devoted to the analysis of fountain codes; for both 
cases, we discover and then study an interesting phenomenon 
of phase transition. Sec. iHll and HVl involve also the design of 
energy-aware forwarding policies where the source forwards 
packets with some fixed probability that we optimize. In 
Sec. [V] we study an alternative class of forwarding policies, 
namely threshold type policies, that achieve the same energy 
restrictions. The performances of the aforementioned coding 
techniques in the case threshold policies are then derived. 
Sec. [VI] reports on simulation results in case of synthetic 
mobility and real-world traces. A concluding section ends 
this paper. 

Related Works The idea to erasure code a message and 
distribute the generated code-blocks over a large number of 
relays in DTNs has been addressed first in [5] and [6]. The 
technique is meant to increase the efficiency of DTNs under 
uncertain mobility patterns. In [5] the performance gain is 
compared to simple replication, i.e. the technique of releasing 
additional copies of the same message. The benefit of erasure 
coding is quantified in that work via extensive simulations for 
various routing protocols, including two-hops routing. 

In [6], the case of non-uniform encounter patterns is ad- 
dressed, showing that there is strong dependence of the optimal 
successful delivery probability on the allocation of replicas 
over different paths. The authors evaluate several allocation 
techniques; also, the problem is proved to be NP-hard. 

General network coding techniques [7] have been proposed 
for DTNs. In [8] ODE based models are proposed under epi- 



2 



demic routing. Semi-analytical numerical results are reported 
describing the effect of finite buffers and contact times; a 
prioritization algorithm is also proposed. 

The work in [9] addresses the use of network coding 
techniques for stateless routing protocols under intermittent 
end-to-end connectivity. A forwarding algorithm based on 
network coding is specified, showing a clear advantage over 
plain probabilistic routing in the delivery of multiple packets. 

Finally, an architecture supporting random linear coding in 
challenged wireless networks is reported in [10]. 

Novel contributions 

The main contribution of this paper is the closed form 
description of the performance of Delay tolerant Ad-hoc Net- 
works under the two-hops relaying protocol when a message 
is split into multiple frames. Our fluid model accounts both 
for the overhead of the forwarding mechanism, captured in 
the form of a given bound on energy, and the probability of 
successful delivery of the entire message to the destination 
within a certain deadline. The effect of coding is included 
in the model and both erasure codes and fountain codes are 
accounted for in closed form. The two coding strategies are 
characterized in the case of static probabilistic forwarding 
policies and in the case of threshold policies. 

Leveraging the model, the asymptotic properties of the 
system are derived in the form of scaling laws. In particular, 
there exists a threshold law ruling the success probability 
which ties together the main parameters of the system. 

To the best of the authors' knowledge, the results contained 
in this work represent the first description in closed form of 
the behavior of erasure codes and fountain codes in challenged 
networks. 

II. The model 

For the ease of reading, the main symbols used in the paper 
are reported in Tab. Q] 

Consider a network that contains N + 1 mobile nodes. 
We assume that two nodes are able to communicate when 
they are within reciprocal radio range, and communications 
are bidirectional. We also assume that contact intervals are 
sufficient to exchange all frames: this let us consider nodes 
meeting times only, i.e., time instants at which a pair of not 
connected nodes fall within reciprocal radio range. 

Also, let the time between contacts of pairs of nodes be 
exponentially distributed with given inter-meeting intensity A. 
The validity of this model been discussed in [11], and its 
accuracy has been shown for a number of mobility models 
(Random Walker, Random Direction, Random Waypoint). @ 

We assume that the transmitted message is relevant during 
some time t. We do not assume any feedback that allows the 
source or other mobiles to know whether the messages has 
made it successfully to the destination within time r. 

2 We recall that studies based on traces collected from real-life mobility [2] 
argue that inter-contact times may follow a power-law distribution, but recently 
the authors of [12] have shown that these traces and many others exhibit 
exponential tails after a cutoff point. 



Symbol 



N 
K 
Al 

H 
A 

T 

Xi(t) 

X(t) 
X(t) 

£(*) 

x 
z 
e 

Ui(t) 

Pi 
V 

Di{r) 
Ps(r) 



Meaning 



number of nodes (excluding the destination) 

number of frames composing the message 

number of frames needed to decode with success probability 

1 — S, S > (fountain codes) 

number of redundant frames 

inter-meeting intensity 

timeout value 

number of nodes having frame i at time t (excluding the 

destination) 

sum of the Xi s 

sum of the X;s when = 1, Vi = 1, 2, . . . , K 

energy expenditure by the whole network in [0, t] 
maximum number of copies due to energy constraint 
:=X(0) 

energy per frame 
forwarding policy for frame i 

static forwarding policy for frame i; p = (pi,P2, • ■ ■ ,pjf) 
sum of the p;S 

probability of successful delivery of frame i by time r 
probability of successful delivery of the message by time r; 
P s (t, K, H) is used to stress the dependence on K and H 



TABLE I 

Main notation used throughout the paper 



The source has a message that contains K frames. If at 
time t it encounters a mobile which does not have any frame, it 
gives it frame i with probability Ui, and we let u = ^\ Ui < 1 
(we shall consider both the case where Ui depends on t and 
the case where it does not). For the message to be relevant, all 
K frames should arrive at the destination by time r. Let Xi(t) 
be the number of the mobile nodes (excluding the destination) 
that have at time t a copy of frame i. Denote by -Di(r) the 
probability of a successful delivery of frame i by time r. Then, 
given the process Xi (for which a fluid approximation will be 
used), we have 

A(r) = 1 - exp (-A f Q T X^ds) 

This expression has been derived in [13] for the fluid model. 

The probability of a successful delivery of the message by 
time r is thus 



A' 



1 — exp 



A J X % (s)ds 



where we assumed that the success probability of a given 
frame is independent of the success probability of other 
frames; this decoupling assumption is confirmed by our nu- 
merical experiments. 

A. Fluid Approximations 

Let X(t) = Yj%=i Then we introduce the following 

standard fluid approximation (based on mean field analysis) 
[14] 

dX t (t) 



(II 



Ui (t)X{N-X(t)) 



(1) 



Taking the sum over all i, we obtain the separable differential 
equation 

dX(t) 



dt 



u{t)X{N - X(t)) 



(2) 



3 



whose solution is 

X(t) =N + {z-N) e - x ti u< -^ dv , X(0) = z 
Thus, Xi (t) is given by the solution of 



dXjjt) 
dt 



Constant policies 



i(t)X(z- N)e~ x ^ u{v)dv (3) 



In the case of constant policies, we let Ui(t) = Pi, p := 
(pi,P2, ■ ■ ■ ,Pk), and p = J^iPi- Hence it follows 



X l (t)=X l (0) + (N~z)^[l-e- x P t ] 

p 1 J 



(4) 



Let us assume X;(0) = for V« = 1, 2, . . . , K: hence, 

X i (t)=N^(l-e- x * t ) (5) 

B. Taking Erasures into Account 

So far we have assumed that the transmission of a frame is 
always successful. Assume that this is not the case and that 
the transmission of a frame fails with some probability q. We 
assume that the process describing whether packets transmis- 
sions are successful or not is i.i.d. We assume moreover that 
a packet that suffers from unrecoverable transmission errors 
at a mobile is discarded so that it does not occupy memory 
space in the relay node; this ensures that such a mobile node 
can still act as a relay at the next meeting with another mobile 
having a packet to be transmitted. 

Losses such as those just described do not need an extra 
modeling: we may replace the rate A of inter-meetings between 
two nodes by A(l— q), i.e., the rate of the potentially successful 
inter-meetings of the nodes. This can be used in the equations 
that we derived in describing the dynamics of the system and 
its performance measures. 

An additional type of loss may occur at the destination: 
this is the case when it is not mobile and it is connected 
to an external network (possibly a wired one): in this case 
losses may occur in that part of the network. Assume that a 
loss there occurs with probability q' . We do not assume any 
feedback that would allow the DTN to know about events that 
occur at the external network. In order to be able to recover 
from such losses we assume that the destination may keep 
receiving copies of the same frame. In particular, a mobile 
that has transmitted a frame to the destination will keep the 
copy and could try to retransmit it to the destination at future 
inter-meeting occasions. This additional type of loss process 
does not alter the fluid dynamics of Xi(t). Its impact on the 
performance is by replacing A in (Q~|i by A(l — q 1 ). 



C. Non-constrained problem. 

The success probability when using p is 

K T 

P s {t,v) = TT I 1 CX P(~ A / Mv)dv) 



K 



l-e- Xpv )dv 



i=l 



where Z{pi) 
L{T,p) := 



1 — exp 



(L(r,p)pi 



and 



(6) 



(7) 



P 



1 — XpT 



-Apr 



For fixed ratios pi /p, P s (r, p) is increasing in p and is 
maximized at p = 1. 

Let Pg(r) be the optimal delivery probability for the 
problem of maximizing P s (r) with p fixed. The proof of the 
following can be found in the Appendix. 

Theorem 2.1: p* = (p/K, ...,p/K) is the unique solution 
to the problem of maximizing P s {t) s.t. 2~2iPi = P> Pi > 0. 

D. Constrained problem. 

Denote by £(t) the energy consumed by the whole network 
for the transmission of the message during the time interval 
[0, t]. It is proportional to X(t) — X(0) since we assume that 
the message is transmitted only to mobiles that do not have the 
message, and thus the number of transmissions of the message 
during [0, t] plus the number of mobiles that had it at time zero 
equals to the number of mobiles that have it. Also, let e > be 
the energy spent to forward a frame during a contact. We thus 
have £(t) = e(X(t) - X(0)). In the following we will denote 
x as the maximum number of copies that can be released due 
to energy constraint. 

We compute in particular the optimal probability of suc- 
cessful delivery of the message by some time r under the 
constraint that the energy consumption till time r is bounded 
by some positive constant. 

Define X(t) to be the solution of (ffjl when u(t) = 1, i.e. 



X{t) =N + (z-N)e 



Note that X(t) = X(pt). 

Denote u(z) := X (x + z) given X(Q) = z, which is 
the time elapsed until x extra nodes (in addition to the initial 
z ones) receive the message in the uncontrolled system. We 
have 



a(z) = -j\og 



N 



N 



For p — a(z)/r we obtain the expression 

N / , i .. .'• 

P 2 



N / / 
i(r, P ) = -(log(l 



N-zl N-z, 



(8) 



(9) 
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Theorem 2.2: Consider the problem of maximizing P s (t) 
subject to a constraint on the energy £(r) < ex. 

(i) If X(t) < x + z (or equivalently, r < <r(z)), then a control 
policy u is optimal if and only if pi — 1/K for all i. 

(ii) If X(u min T) > z + x (or equivalently, M m i n r > a(z)), 
then there is no feasible control strategy. 

(iii) If X(t) > z + x > X(u m - m T) (or equivalently, r > 
a(z) > u m i n r), then the best control policy is given by pi — 
p*/K where 

* <r(z) 



(10) 



and the optimal value is 

p:(t) = 



i 



i 



N ■ 



N 

-)^exp( 



N 



p*K N- 



K 



Proof: Part (i) and (ii) are obvious. Part (iii) follows from 
the fact that X(^-t) = ~X(a(z)) = x + z, so that the 

energy bound is attained for p* — also, the expression 

for P* (t) follows from an immediate application of the result 
in Thm.Oto 0- ■ 

III. Adding Fixed amount of Redundancy 

We add H redundant frames and consider the new message 
that now contains K + H frames. If at time t the source 
encounters a mobile which does not have any frame, it gives 
it frame i with probability p. 

Let S n . p be a binomially distributed r.v. with parameters n 
and p, i.e. 



P(S n , p =m) = B(p,n,m) := 



P m (i- P y 



The probability of a successful delivery of the message by 
time r is thus 

K+H 

P s (t,K,H)= B{Di(T),K + H,j), 

where D^t) = 1 — cxp(— A j Q Xi(s)ds). 

A. Main Result 

Lemma 3.1: The maximum of P s (t, K, H) over {p m > 

0,m = 1, ...,K + H} under the constraint 2~2i=i H Pi = P 
is achieved at p m = pj (K + H). 
The proof is delayed to the Appendix. 

Using the same arguments as those that led to Theorem [22] 
together with Lemma [3T1 yields the following: 

Theorem 3.1: Consider the problem of maximizing 
P s (t, K, H) over pi, i — 1, K + H, subject to a constraint 
on the energy £(t) < ex. 

(i) If X(t) < x + z (or equivalently, r < cr(z)), then a control 
policy u is optimal if and only if pi — l/(K + H) for all i. 

(ii) If X(u m i n r) > z + x (or equivalently, w m i n T > cr(z)), 
then there is no feasible control strategy. 



(iii) If X(t) > z + x > X(u m i n r) (or equivalently, 
t > a(z) > u m i n r), then the best control policy is given by 
Pi = p* /(K + H) where p* is given in ( fTOb and the optimal 
value is 



P* s {t,K,H) 



K+H 

Y, B(p,K + HJ) 

j=K 



where 



p=l-eML(P*,r)p*/(K + H)) 



B. Properties and Approximations 

We now derive further characterizations for the optimal 
success probability; these results will provide both bounds 
for the case of block coding and help also in the analysis 
of fountain codes. 

Corollary 3.1: P* (r, K, H) is increasing with H. 
Proof: We make the following observation. Fix K and 
H and a vector (pi, ...,pk+h) whose entries sum up to p* 
(given in dT0l>>. Then the success probability is the same as 
when increasing the redundancy to H + 1 and using the vector 
(pi, pk+h, 0). By definition, the latter is strictly smaller 
than P*(t, K, H+l) (which is the optimal success probability 
with H + l redundant packets). 

The above is in particular true when taking pi — 
p*/(K + H) (which maximizes P s (t, K, H)), and hence 
P* (r, K, H) < P* (r, K, H+l). ■ 

Now we provide two useful bounds for P* (r, H, K). 

First, it is possible to derive the asymptotic approximation 
of P* (t, H, K) for H — > oo. For the sake of notation, let 
V = L(p, r)p; the following holds 

P: (r, H, K) = e v J2 ( H + K Ue-—-l 



-K 



K-l 



For large values of H + K, the s-th term of the right-hand 
summation writes 



H + K\, -V 



(h + k)\ {-vy 

s J\H + KJ ~ s\{H + K ~ s)\{H + KY 

{-vy i (-vy 



1 - 



H+K 



where the Stirling formula applies, n\ ~ \Z2nn (j) n , with 
/ ~ g meaning lim//^ f/g = 1. 
Corollary 3.2: For K > 1 and r > 0, 



lim P:(r,H,K) = l-e 

H—>oo 



L(p,r)p 



K-l 

E 



-Hp, t))p\ 



We note that in sight of the monotonicity stated in Cor. 13.11 
the limit is reached from below. 
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K=25, N=300, T=3000 s 



K =10 



N=300, 1=3000 s 




50 100 150 200 250 

Number of redundant frames H 



Number of redundant frames H 



0.05 0.1 0.15 

Normalized number of frames K/N 



Fig. 1. a) Success probability for N = 300, K = 25, p = 1 under different values of A; Ai = 0.22 ■ lO -03 ^ 1 , A 2 = 0.091 ■ lO -03 ^ 1 A 3 = 
0.065 • 10 — 03 s _1 b) Success probability for Ai, N = 300, p = 1 under different values of K c) Upper and lower bound p = 1, S = 0.02 as given in 
Cor. 14. 1 1 for various values of N = 50, 125, 300, 500, 5000, versus normalized number of transmitted frames K/N. 



Also, the binomial bound [15, Thm 1.1] applies. For 1 < 
m < n — 1, and u > 1, define u through m = \upn\ . Then 



P(S n ,p >m)< 



,V2 



,1— upn / ^ 



(27rm(n-TO)) 1 /2 1 



V 



1 — up 



(l—up)n 



In Fig [T^) we reported the numerical comparison of the 
optimal delivery probability as a function of the number of re- 
dundant frames, for a particular setting. The asymptotic bound 
is reported as an horizontal dot-slashed line; the binomial 
bound is reported with slashed line. We notice that the bi- 
nomial bound proves a good approximation for lower success 
probabilities, i.e., at smaller values of A. The relative increase 
of the success probability under erasure codes is apparent: for 
instance, the success probability with A = 0.22 • 10 _03 s _1 
increases from 0.12 for H = to one with a few redundant 
frames (H = 12); the maximum attainable improvement, 
though, is dictated by the upper bound. 

In Fig 0J>) the optimal delivery probability is depicted as a 
function of the number of redundant frames, at the increase 
of K; even in this case the binomial bound proves a better 
approximation for lower success probabilities, as it appears in 
the case of K = 60. 

C. Phase transition 

In what follows we elaborate based on result from Thm. 13.11 
and we study the case of large values of TV. Let us assume 
that the total number of frames, H + K, grows as a function 
of N, i.e., H + K = (H + K)(N). 

The question we would like to answer is, in the asymptotic 
regime, what is the effect of redundancy onto the delivery 
probability, that is, how the number of redundant frames 
should scale with respect to the total number of frames. 

We assume throughout that the following limits exist: 

K := lim K(N)/N and H := lim H(N)/N 



N 



N- 



Notice that this implies of course that the energy constraint 
should grow at most linearly with N and we thus assume that 



the limit x := limAr^oo x(N)/N > exists 0. 
Proposition 3.1: Introduce the threshold 



To := Ar( 


'h 


X \ 


log(l-x)J' 


then, the following holds: 








1° 


if k + h < r 


limiv^oo P 3 {t,K,H) = < 





if K + H > T and K > T 




1 

V 


if K + H > T and K < T 



The proof is in the Appendix. We conclude that there exist a 
phase-transition effect. Its threshold Tq is the same as that we 
shall obtain later for the fountain codes. 

The way we interpret such result is that, in order to deliver 
with high probability the message, for large N, the K message 
frames should not exceed such threshold, but the sum of the 
encoded ones should. 

IV. Fountain Codes 

Each time the source meets a node, it sends to it (with 
probability p) a packet obtained by generating a new random 
linear combination of the K original packets. Using fountain 
codes, we know that for any 8 in order for the destination to be 
able to decode the original message with probability at least 
1 — 5, it has to receive at least M := K \og(K / 5) packets [16, 
Chap 50]. A useful expression for large K will be used later: 
if we write M = K(l + a) then a that guarantees that the 
destination can decode the original message with probability 
of at least 1 — S is given by 

_ (log(K/5))2 



K 

The number of packets that reach the destination during the 
time interval [0, r] has a Poisson distribution with parameter 
— L(r,p*)p* where p* is given in ( fTOt and where L(r,p*) is 
given in 



3 In what follows we will exclude the trivial case x - 
to the case when no relaying is allowed. 



0, which corresponds 
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The probability that less than M packets reach the destina- 
tion is given by 



Pm{t) 



^ (-L(T,p*)p*)< 

i=0 



exp(L(r,p*)p*) (11) 



We conclude that P*(t) >l-5- P m (t). 
Finally, we can leverage Cor. 13.21 and obtain 
Corollary 4.1: Given p* as in ( [Tol l. 

1-5- P m (t) < P;(r) < 1 - P m (t) 

We notice that the bound on the right-end of the inequality is 
obtained by using redundancy as in the previous section for 
H > M, then taking the limit of H — > oo. 

A. Numerical examples: a phase transition 

In Fig QJ) we reported the representation of the bounds de- 
scribed above: the two bounds provide a tight characterization 
of the performances of fountain codes; in particular, we notice 
that Pm(t), which in fact is the CDF of a Poisson r.v., tends 
to one as K increases: this causes the success probability to 
tend to zero as K increases. The intuition is that, for a large 
number of transmitted frames, the probability of receiving all 
of them successfully within the given deadline decreases faster 
than the gain obtained by adding redundancy through fountain 
codes. 

But, the numerical insight of the model says more of the 
performance attained by fountain codes. In fact, in Fig QJ) 
we reported P* versus the normalized number of transmitted 
frames K/N, for increasing N. Interestingly, the success 
probability becomes more and more close to a step function at 
the increase of N. We thus observe a phase transition: above 
a given number of transmitted frames, the success probability 
vanishes, and below the same threshold success occurs with 
probability one. 

We shall study this phenomenon analytically in the next 
subsection. 

B. Analysis Asymptotic behavior 

We now want to understand the behavior of the foun- 
tain codes for large values of N in the asymptotic regime 
K = K(N) < N; in what follows, we will assume that 

K := liniAr^oo K(N)/N exists. This implies of course that 
the energy constraint should grow linearly with N and we thus 
assume that the limit x := liniAr^oo x(N)/N exists. We can 

rewrite —L(r,p*)p* = N ■ where we obtain 



£(l-ApV-e-V' 



At 1 



JV- 



log(l 



N-z 



:= lim 



= At 1 



log(l - x) 



We have 



We notice that pjf \t) = Pr{X N < K(N)(l + a) - 1}, 
where Xm is a Poisson r.v. From the Strong Law of Large 
Numbers we have lini/v^oo Xjy/N = Tq P-a.s. Thus 



lim 



N 



lim Pr{X N > K(N)) 



1, if K > r 

o, if k < r 



from which we can deduce 



liriiAr^oo P s (t) 




if K<T 

if k > r 



where we used Cor. 14.11 

This demonstrates the effect of phase transition observed 
numerically in Fig [Tt). 

V. Threshold policies 

The way the energy constraints are handled so far is by 
using only a fraction of the transmission opportunities. This 
was done uniformly in time by transmitting at a probability 
that is time independent; we will refer to this strategy as static 
policy. In this section we consider an alternative way to dis- 
tribute the transmissions: we use every possible transmission 
opportunity till some time limit and then stop transmitting. 
This is motivated by the "spray and wait" policy [17] that is 
known to trade off very efficiently message delay and number 
of replicas in case of a single message. 

In addition we need to specify the values of pf. the proba- 
bility packet i when there is an opportunity to transmit (and 

the time limit has not yet elapsed). We have J^iLiPi = !• 

In the lack of redundancy, we recall that the success 
probability writes 



k r 

^(t,p) = TT( 1 - cx p(- a / X i 



(v)dv) 



Clearly, transmitting always when there is an opportunity 
to transmit is optimal if by doing so the energy constraint are 
not violated. This can be considered to be a trivial time limit 
policy with a limit of r = r. Otherwise, the optimal value r 
for a threshold is the one that achieves the energy constraints, 
or equivalently, the one for which the expected number of 
transmissions during time interval [0, r] by the source is x; 
as before, x is related to the total constraint on the energy 
through the constraint £(t) < ex. The value r of the time 
limit is then given by cr(z), see eq. (0. 

Since we considered a time limit policy, Xi(v) first grows 
till the time limit r is reached and then it stays unchanged 
during the interval (r, t] . For the non-trivial case where r = 
<j(z) we thus have: 



K{N){X+a)-l , 



E 

i=0 



cxp(-iV • 4 N) ) P s ( T ,p) = J] [l - exp ( - A f ^ NpJl - e^) 



dv 
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x=70, K-15,30, N.300, T.3000 s 



x=70, 8=0.05, N=300, T=3000,7000 s 



a) 




10 20 30 40 50 60 70 
Number of redundant frames H 




20 30 40 

Number of frames K 



Fig. 2. A comparison of success probability with erasure codes when using threshold and static policies; N = 300, x = 70, r = 3000, 7000: a) Erasure 
Codes, K = 15, 30 b) Fountain codes, 5 = 0.05. 



K 



\(r-a(z))XM*)) ) = no- 



where Z(pi) := 1-exp (piL(T)) , PiL(r) := -A J Q T Xi(v)dv 
and where we have by eq. © and ^ 



X t {<T{z)) = N Pl (l~e- x °^ 
Also, the integral can be expressed as 



Np t x 
N-z 



T(Z) Y < W N P 

Xi(v)dv = — 



N-z 



log 1 



V N-z) 



In particular, the following holds 



L(r) = ^— - + iV log ( 1 - ) - A(r - a( 2 ))- 



_/Va_ 



iV - z 



Ar - /3(a), 



N-z 



where 



m = - 



Nx 
N-z 



Finally, the pi's are selected to be all equal (and to sum to 
1) due to the same arguments as in the proof Theorem 12. II as 
here too, log(Z) is concave in its argument. 

In the case of redundancy, the calculations are similar to 
those in eq. (fT2l . where 

H+K / \ 

Ps(t,H,K)= J2 \z s (p i )(l-Z(p i )) N - s (12) 

s=K \ 



Hence, given H > 0, it holds 



^ z Xt + (3(z) 



H + K 



(13) 



H+K 

E 

s=K 



H + K 



exp 



H + K 



- 1 



Here again, we used the fact that the success probability is 
maximized for equal p^s, i.e. 




if t < a(z) 
if t > a(z) 



and P* (t, H, K) is given as in eq. (TT~3b . The proof follows 
the same lines as that of Lemma 13.11 

As a final remark, consider the trivial case where the energy 
constraint is not active: r < a(z), i.e., it is optimal to transmit 
all the x packets up to time t; in this case results from 
Thm.EDhold. 

Now we want to specialize the use of threshold policies 
in the case of fountain codes when the energy constraint is 
active: r > <r(z). In this case, the number of packets that reach 
the destination during the time interval [0, r] has a Poisson 
distribution with parameter A = —L(t) = -^tjAt + (3(z); the 



probability that less than M 
destination is given by 



Klog(K) packets reach the 



Pm{t) = exp 



Nx 



N- 



Xt - 0(z) 



M-l ^ 



Nx 



\\N 



Xr+P{z] 



and the statement of Cor. 14. II holds accordingly (notice that 
when the energy constraint is not active, we fall back to the 
original form of Cor 14. It . 

A. Comparison with static policies 

Here we would like to see what is the relative performance 
of static and threshold policies, both for fountain codes and 
erasure codes. In Fig. [2^) and Fig. [2J3) we reported on the case 
a bound on energy exists x — 70; in such case, a(z) = 1000 s. 
As concerns erasure codes, when the number of frames is high 
(30) the usage of a large number of redundant frames proves 
much more effective compared to static policies. Conversely, 
a for a lower number of frames, the advantage of threshold 
policies is less marked. 

In the case of fountain codes, threshold policies are again 
more efficient than static policies, and the effect is more 
relevant for large values of the time constraint r. We recall 
that we refer to optimal static policies. 



s 




Fig. 3. a) Simulation results for the success probability of erasure coded messages, static policy, H = 10, 20, N = 200, r = 80000 s, x = 70. b) Same as 
a) but under threshold policies c) Simulation results for the success probability of fountain coded messages, static policy, N = 200, r = 80000 s, x = 70, 
<5 = 0.02. 



VI. Numerical Validation 

In this section we provide a numerical validation of the 
model. Our experiments are trace based; message delivery is 
simulated by a Matlab® script receiving as input pre-recorded 
contact traces. 

Synthetic Mobility 

We considered first a Random Waypoint (RWP) mobility 
model [18]. We registered a contact trace using Omnet++ with 
N = 200 nodes moving on a squared playground of side 5 
Kms. The communication range is R = 15 m, the mobile 
speed is v = 5 m/s and the system starts in steady-state 
conditions in order to avoid transient effects [19]. The time 
limit is set to t — 80000 s, which corresponds roughly to 1 
day operations, and the constraint on the maximum number 
of copies is x = 70. With this first set of measurements, we 
want to check the fit of the model for the erasure codes and 
fountain codes. In the case of erasure codes, we fixed H = 10 
and 20 and increased the number of message frames K. We 
selected at random pairs of source and destination nodes and 
registered the sample probability that the message is received 
at the destination by time r. As seen in Fig. [3^) the fit with 
the model is rather tight and an abrupt transition from high 
success probability to zero is visible. Also, in Fig. [3})) we 
reported on the results obtained in case of threshold policies; 
the fit is similar to what obtained for static policies, confirming 
the gain of performance with respect to static policies. 

We repeated the same experiment in the case of fountain 
codes, as reported in Fig. [3};) for static policies; in this case 
the code specific parameter is S = 0.02 and again we increased 
K. Even in this case we see that the threshold effect predicted 
by the model is apparent. 

Real World traces 

Our model captures the behavior of a sparse mobile ad 
hoc network under some assumptions: the most stringent is 
the uniformity and the stationarity of intermeeting intensities. 
We would like now to understand what is the impact of non- 
uniform and non-stationary encounter patterns. We considered 
two sets of experimental contact traces: 



Haggle: in [2] and related works, the authors report exten- 
sive experimentation conducted in order to trace the meeting 
pattern of mobile users. A version of iMotes, equipped with 
a Bluetooth radio interface, was distributed to a number of 
people, each device collecting the time epoch of meetings 
with other Bluetooth devices. In the case of Haggle traces, 
due to the presence of several spurious contacts with erratic 
Bluetooth devices, we restricted the contacts to a subset having 
experienced at least 50 contacts, resulting in 19 active nodes. 

CN: the CN dataset has been obtained by monitoring 21 
employee within Create-Net and working on different floors of 
the same building during a 4-week period. Employees volun- 
teered to carry a mobile running a Java application relying on 
Bluetooth connectivity. The application periodically triggers 
(every 60 seconds) a Bluetooth node discovery; detected nodes 
are recorded via their Bluetooth address, together with the 
current timestamp on the device storage for a later processing. 

Fig E] a) and b) depict the results of experiments performed 
with these data sets. A major impact is played by the non- 
stationarity of traces. This is mainly due to "holes" appearing 
in the trace which impose unavoidable cutoff effects on the 
success probability. In particular, in the case of fountain 
coding, for K > 1, i.e., when the original file is actually 
fragmented, the performance is rather poor. This is due to 
the increase with K of the number of frames M required for 
decoding, exacerbated by the reduced size of the network. For 
example, for K = 3 and S = 0.05 it holds M = 13, i.e. M 
is close to N for the Haggle trace and larger than N/2 for 
the CN trace; this means that, in order to deliver the message, 
basically almost all nodes should meet the destination and 
deliver a frame within r. 

With erasure codes, both traces show the characteristic cut- 
off of performances at the increase of K. The decay, though, 
depends on the trace considered; in particular, a close look 
to the two data sets showed that the Haggle trace has higher 
average inter-meeting intensity compared to the CN trace over 
the considered interval; the shape seen in Fig @k) resembles 
more closely the theoretical sigmoid cutoff predicted by the 
model. 
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Fig. 4. a) Simulation results for the success probability of static policies, Haggle trace, N = 17, r = 80000 s, x = 17 and <5 = 0.05 b) Simulation results 
for the success probability of static policy, CN trace, N = 21, r = 80000 s, x = 17 and 5 = 0.05. 



VII. Conclusions and Remarks 

We have considered in this paper the tradeoff between 
energy and probability of successful delivery in presence of 
finite duration of contacts or limited storage capacity at a 
node: it can store only part (a frame) of a file that is to 
be transferred. To improve performance we considered the 
generation of erasure codes at the source which allows the 
DTN to gain in spatial storage diversity. Both fixed erasure 
codes (Reed Solomon type codes) as well as rateless fountain 
codes have been studied. 

Fountain codes can be viewed as special case of general 
network coding such as those studied in [7], [8], [10]. We note 
however that in order to go beyond fountain codes (in which 
coding is done at the source) and consider general network 
coding (where coding is also done in the relay nodes), one 
needs not only change the coding approach but one should 
allow storage of several frames at each relay node. Also, 
interesting hints come from the presence of non-stationary 
patterns arising in real world traces. New tradeoff issues arise 
which we shall study in future work. 
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VIII. Appendix 
A. Proof of Theorem \2.1\ 

Proof: The case p = is trivial so we assume below 
p > 0. We have 

K 

P*(r) = max P s (r) = max TT Z(pi) 

P'12iPi=P P'X^i Pi = P 7T 
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p* is optimal if and only if it maximizes <?(p) := 

E£i l°g(^(pi)) s.t. Y,iPi = P. Pi > 0. Note that L(t) is 
non-positive, log Z turns out to be a concave function. It then 
follows from Jensen's inequality that 



Notice also that, 



s(p) = ^i>g(^)) 

1=1 

H*X> 



< log 



) (14) 



where equality is attained for pi = p/K. This concludes the 
proof. ■ 



B. Proof of Lemma 15.71 

Proof: Let A(K,H) be the set of subsets h C 
{1, K + H} that contain at least if elements. For example, 

{1,2,..., If} e A(K,H). Fix pi such that Y%X H Pi = P- 
Then the probability of successful delivery by time r is given 
by 



P s (t,K,H) = 



E 

heA(H,K) ieh 



For any i and j in {1, if + H} we can write 

P s {t,K,H) = Z(p t )Z{ Pj ) gi + (Z(pi) + Z(p 2 )) 52 + g 3 

where g\, 52 and 53 are nonnegative functions of 
{Z(p m ),m ^ i, m j}. For example, 

9i= E II Z ( p ™) 

heA {i , j} (H,K) men 

where A V (K,H) is the set of subsets h C {1, ...,K+H} that 
contain at least if elements and such that v C h. 

Now consider maximizing P s (r, if, if) over and pj that 
satisfy p' = pi + pj for a given p' < p. Since Z(-) is strictly 
concave, it follows by Jensen's inequality that Z(pi) + Z(pf) 
has a unique maximum at pi = pj = p'/2. This is also the 
unique maximum of the product Z(pi)Z(pj) (using the same 
argument as in eq. (TBI), and hence of P s (r, if, H). 

Since this holds for any i and j and for any p' < p, this 
implies the Lemma. ■ 



C. Proof of Proposition li.il 
Proof: From Thm. 13.11 

P;(r, if, H) = P{S H+ K3 < H + K} - P{S H+K .p < K}, 
where p = 1 — cxp ( L ^^ p j ■ 

We can rewrite —L(r,p*)p* = N ■ where we obtain 



N-z 



l-(l-\p* T -e- Xp * T ) =Ar(l+- 
p*\ J \ l 0g (l-^ 



E[S H +K,p] = (H + K)p=(H + K)[l-e-- 



= (H + K) 



from which it follows that 



AT, 



(N) 



H + K 



lim E{S H+K , P } = ^ ^ = 

iV^OO N N^OD 

Notice that the binomial r.v. Sn+K.p can t> e interpreted 
as the sum of H + K independent binary random variables 
i n with mean p and variance p(l — p). Also, the series of 
the normalized variances of = — P) < is finite. 

Thus, the Strong Law of Large Numbers [20] ensures that 
liniAT^oo S H +K,p/N = r P-a.s. 

Now we can derive 

N->oc I 1, if K > 1 

and, in a similar way, 

N—>ac M it K > 1 



Finally, by enumeration, 



if if + # < r 



lim P s (t, K,H) = { if if + if > r and if > r 

N—*oo 

1 if if + if > T and if < T 

This concludes the proof. 



and we define 



